Multi-task Learning for Gender and Age Prediction on Chinese Microblog
نویسندگان
چکیده
The demographic attributes gender and age play an important role for social media applications. Previous studies on gender and age prediction mostly explore efficient features which are labor intensive. In this paper, we propose to use the multi-task convolutional neural network (MTCNN) model for predicting gender and age simultaneously on Chinese microblog. With MTCNN, we can effectively reduce the burden of feature engineering and explore common and unique representations for both tasks. Experimental results show that our method can significantly outperform the state-of-the-art baselines.
منابع مشابه
Personalized Microblog Sentiment Classification via Multi-Task Learning
Microblog sentiment classification is an interesting and important research topic with wide applications. Traditional microblog sentiment classification methods usually use a single model to classify the messages from different users and omit individuality. However, microblogging users frequently embed their personal character, opinion bias and language habits into their messages, and the same ...
متن کاملAn Empirical Study on Chinese Microblog Stance Detection Using Supervised and Semi-supervised Machine Learning Methods
Nowadays, more and more people are willing to express their opinions and attitudes in the microblog platform. Stance detection refers to the task that judging whether the author of the text is in favor of or against the given target. Most of the existing literature are for the debates or online conversations, which have adequate context for inferring the authors’ stances. However, for detecting...
متن کاملProfiling Microblog Authors using Concreteness and Sentiment - Know-Center at PAN 2016 Author Profiling
The PAN 2016 author profiling task is a supervised classification problem on cross-genre documents (tweets, blog and social media posts). Our system makes use of concreteness, sentiment and syntactic information present in the documents. We train a random forest model to identify gender and age of a document’s author. We report the evaluation results received by the shared task.
متن کاملRules-based Chinese Word Segmentation on MicroBlog for CIPS-SIGHAN on CLP2012
In this evaluation, we have taken part in the task of the Word Segmentation on Chinese MicroBlog. In this task, after analysing the feature of the MicroBlog and the result of our original Chinese word segmentation system, four Optimization Rules are proposed to optimize the segmentation algorithm for Chinese word segmentation on MicroBlog corpora. The optimized segmentation system is based on c...
متن کاملOverview of NLPCC Shared Task 4: Stance Detection in Chinese Microblogs
This paper presents the overview of the shared task, stance detection in Chinese microblogs, in NLPCC-ICCPOL 2016. The submitted systems are expected to automatically determine whether the author of a Chinese microblog is in favor of the given target, against the given target, or whether neither inference is likely. Different from regular evaluation tasks on sentiment analysis, the microblog te...
متن کامل